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Question 5 


Intent of Question 





The primary goal of this question was to assess students’ ability to identify, set up, perform, and interpret 
the results of an appropriate hypothesis test to address a particular question. More specific goals were to 
assess students’ ability to (1) state appropriate hypotheses: (2) identify the appropriate statistical test 
procedure and check appropriate conditions for inference; (3) calculate the appropriate test statistic and 
p-value; and (4) draw an appropriate conclusion, with justification, in the context of the study. 

Solution 

Step 1: States a correct pair of hypotheses. 


Let UUgig Tepresent the population mean difference in purchase price (woman — man) for identically 
equipped cars of the same model, sold to both men and women by the same dealer, in the county. 


The hypotheses to be tested are Ho : Mgig =O versus H, : gi > O. 


Step 2: Identifies a correct test procedure (by name or by formula) and checks appropriate conditions. 


The appropriate procedure is a paired t-test. 


q 





[he conditions for the paired t-test are: 
1. The sample is randomly selected from the population. 


The population of price differences (woman — man) is normally distributed, or the sample size is 
large. 





The first condition is met because the car models and the individuals were randomly selected. The 
sample size (n = 8) is not large, so we need to investigate whether it is reasonable to assume that the 
population of price differences is normally distributed. The dotplot of sample price differences reveals a 
fairly symmetric distribution, so we will consider the second condition to be met. 


Step 3: Correct mechanics, including the value of the test statistic and p-value (or rejection region). 








i Sate wian aod OW. -oO) 
he test statistic is t = 33071 * 3.12. 
V8 
The p-value, based on a t-distribution with 8 —-1= 7 degrees of freedom, is 0.008. 


Step 4: States a correct conclusion in the context of the study, using the result of the statistical test. 


Because the p-value is very small (for instance, smaller thana = 0.05), we reject the null hypothesis. 
The data provide convincing evidence that, on average, women pay more than men in the county for 
the same car model. 
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Question 5 (continued) 
Scoring 
Each of steps 1, 2, 3, and 4 were scored as essentially correct (EF), partially correct (P), or incorrect (I). 
Step 1 is scored as follows: 
Essentially correct (E) if the response identifies the correct parameter AND states correct hypotheses. 


Partially correct (P) if the response identifies the correct parameter OR states correct hypotheses, but 
not both. 


Incorrect (I) if the response does not meet the criteria for E or P. 


Note: Defining the parameter symbol in context or simply using common parameter notation is 
sufficient. 


Step 2 is scored as follows: 


Essentially correct (EF) if the response identifies the correct test procedure (by name or by formula) AND 
checks both conditions correctly. 





Partially correct (P) if the response correctly completes two of the three components (identification of 
procedure, check of randomness condition, check of normality condition). 


Incorrect (I) if the response does not meet the criteria for E or P. 


Note: The random sampling condition can be verified by referring to the random selection of car 
models or to the random selection of male and female car buyers. 


Step 3 is scored as follows: 
Essentially correct (E) if the response correctly calculates both the test statistic and the p-value. 
Partially correct (P) if the response correctly calculates the test statistic but not the p-value; 
OR 
if the response calculates the test statistic incorrectly but then calculates the correct p-value for the 
computed test statistic. 


Incorrect (I) if the response does not meet the criteria for E or P. 


Note: If the response identifies a z-test for a mean as the correct procedure in step 2, then the response 
can earn a P in step 3 if both the test statistic and the p-value are calculated correctly. 
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Question 5 (continued) 


Step 4 is scored as follows: 


Essentially correct (E) if the response provides a correct conclusion in context, also providing 
justification based on linkage between the p-value and the conclusion. 


Partially correct (P) if the response provides a correct conclusion with linkage to the p-value, but not in 
context; 

OR 
if the response provides a correct conclusion in context, but without justification based on linkage to 
the p-value. 


Incorrect (I) if the response does not meet the criteria for E or P. 


Notes: 


If the conclusion is consistent with an incorrect p-value from step 3 and also in context with 
justification based on linkage to the p-value, step 4 is scored as E. 
A response that performs a two-sample t-test with correct calculations should fail to reject Ho. 


A conclusion that is equivalent to “accept Hj,” (such as “we conclude that women pay the same 


amount as men, on average”), either as a stated decision or as a conclusion in context, cannot 
be scored as EF... Such a response will be scored as P provided that the conclusion is in context 
with linkage. Such a response will be scored as I if it lacks either context or linkage. 


Each essentially correct (E) step counts as 1 point. Each partially correct (P) step counts as % point. 


4 


3 


2 


Complete Response 
Substantial Response 
Developing Response 


Minimal Response 


If a response is between two scores (for example, 2% points), use a holistic approach to decide whether 
to score up or down, depending on the overall strength of the response and communication. 
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5. A researcher conducted a study to investigate whether local car dealers tend to charge women more than men for 
the same car model. Using information from the county tax collector’s records, the researcher randomly selected 
one man and one woman from among everyone who had purchased the same model of an identically equipped 
car from the same dealer. The process was repeated for a total of 8 randomly selected car models. 


The purchase prices and the differences (woman — man) are shown in the table below. Summary statistics are 
also shown. 
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Dotplots of the data and the differences are shown below. 
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Do the data provide convincing evidence that, on average, women pay more than men in the county for the same 
car model? 
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5. A researcher conducted a study to investigate whether local car dealers tend to charge women more than men for 
the same car model. Using information from the county tax collector’s records, the researcher randomly selected 
one man and one woman from among everyone who had purchased the same model of an identically equipped 

car from the same dealer. The process was repeated for a total of 8 randomly selected car models. 


The purchase prices and the differences (woman — man) are shown in the table below. Summary statistics are 
also shown. 
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Dotplots of the data and the differences are shown below. 
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5. A researcher conducted a study to investigate whether local car dealers tend to charge women more than men for 
the same car model. Using information from the county tax collector’s records, the researcher randomly selected 
one man and one woman from among everyone who had purchased the same model of an identically equipped 
car from the same dealer. The process was repeated for a total of 8 randomly selected car models. 


The purchase prices and the differences (woman — man) are shown in the table below. Summary statistics are 
also shown. 
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Dotplots of the data and the differences are shown below. 
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Do the data provide convincing evidence that, on average, women pay more than men in the county for the same - 
car model? 
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Question 5 
Overview 


The primary goal of this question was to assess students’ ability to identify, set up, perform, and interpret 
the results of an appropriate hypothesis test to address a particular question. More specific goals were to 
assess students’ ability to (1) state appropriate hypotheses; (2) identify the appropriate statistical test 
procedure and check appropriate conditions for inference; (3) calculate the appropriate test statistic and 
p-value; and (4) draw an appropriate conclusion, with justification, in the context of the study. 


Sample: 5A 
Score: 4 


In Step 1 the student states a correct pair of hypotheses and defines the parameter appropriately, so this 
step was scored as essentially correct. In Step 2 the response correctly identifies a “matched pairs t-test” 
as the appropriate inference procedure. Because the random and normal conditions are verified 
appropriately, this step was scored as essentially correct. (Note that the comments about independence 
were viewed as extraneous.) In Step 3 the student correctly calculates the test statistic and p-value with 
appropriate supporting work. As a result, this step was scored as essentially correct. In Step 4 the 
student's conclusion begins with a correct decision about the null hypothesis, with linkage to the p-value. 
This is followed by a correct contextual conclusion about the mean difference in purchase price for women 
and men. Consequently, Step 4 was scored as essentially correct. Because all four steps were scored as 
essentially correct, the response earned a score of 4. 


Sample: 5B 
Score: 3 


In Step 1 the student gives a reasonable description of the parameter and states a correct pair of 
hypotheses. As a result, this step was scored as essentially correct. In Step 2 the student identifies a 
matched pairs t-test as the appropriate procedure, and checks the random condition in an acceptable way. 
However, the student discusses all three dotplots when examining the normal condition. With two of the 
three required components satisfied, Step 2 was scored as partially correct. In Step 3 the student obtains the 


correct test statistic and p-value, but makes a notation error when referring to the sample mean as {,, SO 


this step was scored as partially correct. In Step 4 the conclusion begins with a correct decision about H,, 


with linkage to the p-value. This is followed by a contextual conclusion about women paying more than 
men that does not include the idea of on average. Due to this omission, Step 4 was scored as partially 
correct. Because one step was scored as essentially correct and three steps were scored as partially correct, 
this response was initially scored as a 2%. Looking holistically at the paper, the student demonstrates clear 
understanding of the process of performing a paired t-test. Therefore, this response earned a score of 3. 
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Question 5 (continued) 


Sample: 5C 
Score: 2 


In Step 1 the response begins by stating a pair of hypotheses with two parameters, which gives no indication 
of the paired data setting in this question. Because the parameter definitions are reasonable, Step 1 was 
scored as partially correct. In Step 2, the student identifies an incorrect procedure (two-sample t-test) and 
checks the normality condition using two normal probability plots, which is not appropriate for a paired 
t-test. Although the random condition is verified, with only one of the three required components present, 
Step 2 was scored as incorrect. In Step 3 the student provides a correct test statistic and p-value for a two- 
sample t-test, so this step was scored as essentially correct. In Step 4, the first two sentences of the 
conclusion include all of the required elements for an essentially correct response. However, the last sentence 
is equivalent to accepting the null hypothesis, so Step 4 was scored as partially correct. Because one step 
was scored as essentially correct, two steps were scored as partially correct, and one step was scored as 
incorrect, the response earned a score of 2. 
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